# GRPO reinforcement learning
Reasongen R1 SFT
Apache-2.0
ReasonGen-R1 is a text-to-image model trained on image prompts and reasoning basis datasets through supervised fine-tuning (SFT), with the explicit 'thinking' ability based on text.
Text-to-Image
Transformers

R
Franklin0
312
1
Gazal R1 32B GRPO Preview
Apache-2.0
Gazal - R1 - 32B is a language model specifically designed for medical reasoning and clinical decision - making. It is built on Qwen 3 32B and demonstrates excellent performance in the professional medical field.
Large Language Model
Transformers

G
TachyHealth
116
1
Seg Zero 7B Best On ReasonSegTest
Other
Seg-Zero-7B is an image segmentation model based on reasoning chain guidance, featuring a decoupled architecture that includes a reasoning model and a segmentation model. It achieves zero-shot generalization capabilities through GRPO reinforcement learning training.
Image Segmentation
Transformers English

S
Ricky06662
724
0
Qwen2.5 0.5B Instruct Gensyn Swarm Peaceful Exotic Butterfly
A fine-tuned version based on Gensyn/Qwen2.5-0.5B-Instruct, trained using the TRL framework and GRPO algorithm, suitable for instruction-following tasks.
Large Language Model
Transformers

Q
juliannode
16
2
Captain Eris Violet GRPO V0.420
Other
Captain-Eris_Violet is an advanced language model developed through multi-stage supervised fine-tuning, QLoRA adapters, and GRPO-optimized RLHF, suitable for role-playing and dialogue generation.
Large Language Model
Transformers English

C
Nitral-AI
1,355
21
Featured Recommended AI Models